Strong nonlocality: A trade-off between states and measurements 
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Measurements on entangled quantum states can produce outcomes that are nonlocally correlated. 
But according to Tsirelson's theorem, there is a quantitative limit on quantum nonlocality. It is 
interesting to explore what would happen if Tsirelson's bound were violated. To this end, we consider 
a model that allows arbitrary nonlocal correlations, colloquially referred to as "box world" . We show 
that while box world allows more highly entangled states than quantum theory, measurements in 
box world are rather limited. As a consequence there is no entanglement swapping, teleportation or 
dense coding. 
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I. INTRODUCTION 

Despite its great explanatory and predictive power, the 
standard formalism of quantum theory - in which states 
are represented by vectors in a complex Hilbert space - 
retains an abstract mathematical character. Of course 
there is no reason why Nature should not be described 
by an abstract mathematical formalism. But it is no- 
table that in quantum theory textbooks, the formalism 
is simply postulated. It is not, say, derived from a small 
set of elementary physical considerations in the manner 
of special relativity. This invites the question; why that 
structure, as opposed to any other? 

One way to approach this question, or at least gain 
some insight, is to compare and contrast quantum the- 
ory with other models - theoretical possibilities which 
do not describe our universe, but which can nonetheless 
be explored. In this paper, we investigate one particu- 
lar non-classical, non-quantum theory. In [l| , this theory 
was called generalized non-signalling theory, or GNST for 
short, as it admits all non-signalling correlations 0, 
Here we call it box world. 

One of the notable features of box world is that, as in 
quantum theory, measurements on separate but entan- 
gled systems can produce outcomes that are nonlocally 
correlated, i.e., which cannot be explained by any local 
hidden- variable model [1, @. It is already known that 
nonlocal correlations are useful in many information the- 
oretic tasks [1, 01 • But in quantum theory, there is a 
quantitative limit on the amount of nonlocality that cor- 
relations can have [8]. In box world, by contrast, arbi- 
trary nonlocal correlations can be produced, as long as 
they do not permit instantaneous signalling. This has 
consequences. For example, with stronger than quantum 
correlations, it is known that communication complexity 
problems become trivial, requiring only constant com- 
munication [loj . On the other hand, as we show in 
this paper, possibilities for measurement in box world 
are in some ways rather limited. It turns out that there 
is nothing analogous to a Bell measurement. We also 
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show that there is no entanglement swapping, teleporta- 
tion, or dense coding. This extends to the whole of box 
world previous special-case results in [ij, |Tl| . 

As entanglement swapping is possible in the quantum 
world [l^, it follows that box world does not contain 
quantum theory as a special case, and that it cannot be 
the true theory describing reality. 



II. A FRAMEWORK FOR PROBABILISTIC 
THEORIES 

In order to compare classical theories, quantum theo- 
ries, and alternatives such as box world, we need a com- 
mon mathematical framework in which the different the- 
ories can be written down. We begin by describing such 
a framework. It is operational in flavour. This means, 
for example, that system, preparation and measurement 
are all taken as basic terms. Different ways of prepar- 
ing a system will prepare different states. The state of 
a system determines the outcome probabilities for any 
measurement that can be performed on the system. The 
framework we describe and the notation we use is closely 
based on that of [l^ and However, we should note 
that there is nothing very novel in the framework itself, 
and that numerous formalisms have been developed over 
the years, intended to be operational generalizations of 
classical and quantum theory (see e.g., [l3l - [l6| ). 



A. Single systems 

One way of specifying the state of a system would be 
to give an exhaustive list of the outcome probabilities for 
every possible measurement. However, as Hardy points 
out [l3|, most physical theories have enough structure 
that it is not necessary to give such an exhaustive list in 
order to specify the state fully. Instead, for each type of 
system, assume that its state can be completely charac- 
terised by the outcome probabilities for some finite subset 
of all possible measurements. Following Hardy, we refer 
to the subset chosen to represent the state as fiducial 
measurements. If the outcome probabilities for the fidu- 
cial measurements are known, then the state is known. 
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and the outcome probabilities for any other measurement 
can be inferred. For example, the state of a single qubit 
in quantum theory corresponds to a density operator on 
a 2-dimensional Hilbert space. But equally, it can be 
completely characterised by the outcome probabilities of 
measurements corresponding to the three Pauli opera- 
tors, Ux, <Jy and Uz- Note that the choice of which mea- 
surements to use as fiducial measurements is not unique. 

A convenient way of writing down the state is as a vec- 
tor P, with components P{a\x), where this is the proba- 
bility of obtaining outcome a when fiducial measurement 
X is performed. Obviously, P{a\x) must be positive and 
normalised such that X]a^('^N) ~ 1- ^^^i' example, for 
binary valued a and x: 



' P(1|0) ' 
P(0|1) 

\pmJ 



(1) 



Within a particular operational model, it is not neces- 
sarily the case that any vector P represents an allowed 
state (that is, a state which can actually be prepared). 
In quantum theory there is no qubit state that assigns 
probability 1 to the -1-1 outcome for all three Pauli mea- 
surements. An operational model must specify, for each 
type of system, a set V of states which can physically be 
prepared. We assume that arbitrary probabilistic mix- 
tures of states can be prepared (e.g., by tossing some 
coins and preparing either state P or state Q depending 
on the result). Hence V is convex. 



B. Multi-partite systems 

Most of the interesting questions that can be asked in 
information theory involve more than one system. So 
how should we describe the state of a multi-partite sys- 
tem in a general operational model? Consider n sys- 
tems, Ai, . . . , An, with a set of fiducial measurements for 
each. If a fiducial measurement xi is performed on Ai, 
X2 on A2, and so on, then at the least, the joint state of 
Ai, . . . , An should determine a joint probability for each 
combination of outcomes. We assume something further: 
a specification of the joint probability of each combina- 
tion of outcomes for each possible combination of fidu- 
cial measurements is sufficient to determine completely 
the joint state. Note that this property does indeed hold 
in both quantum theory and classical probability the- 
ory. But it is not trivial. For example, in an alternative 
quantum theory, formulated using real instead of com- 
plex Hilbert spaces, the property does not hold [1343 ■ 

It is convenient to write a multi-partite state of n sys- 
tems in the form of an 71-dimensional array, with entries 
P(a|x), where x — {xi, . . . ,Xn) specifies a fiducial mea- 
surement for each subsystem, a is a list of the correspond- 
ing measurement outcomes, and P(a|x) is the probabil- 
ity of getting a given x. For example, for two systems. 



each with a binary measurement choice x, and binary 
outcomes a: 



( P(00|00) P(01|00) 
P(10|00) P(11|00) 



P(00|10) P(01|10) 
VP(10|10) P(11|10) 



P(00|01) P(01|01) \ 
P(10|01) P(ll|01) 



P(00|11) P(01|ll) 
P(10|ll) P(ll|ll) / 



(2) 



Conditions of positivity and normalisation apply as be- 
fore. 

Finally, all multi-partite states must satisfy the no- 
signalling conditions: 

P(a|x) is independent of x„ for all n. 

These ensure that separated parties cannot send mes- 
sages to one another simply by making measurements on 
their subsystems. Arguably, if the no-signalling condi- 
tions do not hold, then we had no right to be speaking 
of separate subsystems in the first place. 

A multi-partite state is a product state if P satisfies 

P(a|x) = Pi(ai|a;i)P2(a2|a;2) . . . P„(a„|a;„), (3) 

where Pi{ai\xi) is a valid state of the ith system. A state 
is separable if it can be written as a convex combination 
of product states, otherwise it is entangled. 

C. Measurements 

In general, the fiducial measurements will not be the 
only measurements that one can perform on a system. 
For example, on a single qubit, there is a measurement 
corresponding to (T450 = l/y/2{ax + Cz). On two qubits 
there is a Bell measurement. By the definition of the fidu- 
cial measurements, it must be possible to derive the mea- 
surement probabilities for all such measurements from 
the P(a|x). 

In fact, by considering mixtures of states, it can be 
shown that the probability Pr(r) of obtaining a particular 
outcome r in any measurement must be a linear function 
of the fiducial measurement probabilities P, [l^ . We can 
therefore associate an effect R,. with each measurement 
outcome, where is an array with components i?r(a|x), 
such that 



Pr(r) = Rr • P = ^ P.r(a|x)P(a|x). 



(4) 



A measurement with various possible outcomes is associ- 
ated with a set {R^}- For example, consider again a qubit 
in quantum theory, with fiducial measurements chosen to 
be CTx , cTj/ , ■ The measurement corresponding to cr45o is 
represented by the 1-dimensional arrays 



R4 



2-1 
2-1 



2- 



R 



2-1 
2-1 

2-i 



(5) 
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Note that the array Rr associated with a measurement 
outcome is not in general unique, since there may be a 
different array satisfying Rr-P = R;-PVP€'P. 

A particular operational model must contain a speci- 
fication of the set of measurements that can physically 
be performed on a particular type of system. There is a 
constraint: any measurement must correspond to a set 
{Rr} such that 



Rr • p > wyp eV 



and 



1 VPs P. 



(6) 



(7) 



Furthermore, if a measurement is performed on one sub- 
system of a bipartite system, it is possible to calculate 
the subsequent ("collapsed") state of the other subsys- 
tem, conditioned on a particular outcome. Clearly this 
should be an allowed state of that subsystem. 



D. Dynamics 

In addition to preparations and measurements, it may 
be possible to perform transformations on a system, i.e., 
to act on it in such a way that the system is preserved 
but its state changes. In the most general case, a system 
can change into a system of a different type. But here 
we consider only transformations that preserve the type 
of system. Such a transformation can be represented as 
a map T : V ^ P. As with measurements, a consider- 
ation of mixed states implies that T is linear. Thus a 
transformation can be represented by an array such that 



P'(a'|x') = E^(^>''^l^)^(^l^)- 



(8) 



For each type of system, an operational model should 
specify a set of physically possible transformations. A 
valid transformation should satisfy 



T(P) evyp eV. 



(9) 



There are other consistency conditions that the sets of al- 
lowed states, measurements and transformations should 
satisfy, which we will not go into in detail. For example, 
if a transformation is followed by a fiducial measurement, 
then this process taken as a whole should correspond to 
a valid measurement on the initial state. 



III. BOX WORLD 



1. Positivity: P(a|x) > 

2. Normalisation: |P| = Ea-^ = 1 

3. No-signalling: -P(a|x) is independent of a;„ 

is an allowed state. (2^ 

Two subtleties: first, the normalisation condition is 
only stated for the measurement choice x = 0. But the 
no-signalling conditions are then sufficient to ensure nor- 
malisation for all measurement choices since they imply 



EP(a|x)=EP(a|x'). 



(10) 



Second, although the no-signalling conditions refer only 
to fiducial measurements, it can be shown that they are 
sufficient to prevent signalling using any kind of measure- 
ment flj. 

Box world permits many states that do not have coun- 
terparts in quantum theory. An interesting example is 
this bipartite state: 



PpR(aia2\xiX2) 



i : fli + a2 = X1X2 (mod 2) 
: otherwise 



(11) 

where xi, 0:2, ai, 02 € {0,1}. The correlations generated 
by performing fiducial measurements on this state are 
nonlocal, meaning that they violate the Clauser-Horne- 
Shimony-Holt (CHSH) inequality [5]. In fact, they are 
more nonlocal than is possible in quantum theory, be- 
cause they return a value of 4 for the CHSH expression. 
Tsirelson's theorem Q shows that quantum correlations 
always return a value < 2 a/2, and CHSH showed that 
local correlations always return a value < 2. These su- 
perquantum nonlocal correlations have appeared in the 
literature before 0, Q , and they are sometimes referred 
to as a Popescu-Rohrlich (PR) box. Thus we refer to this 
state as the PR box state. 

What are the allowed measurements in box world? The 
model is defined so that any set {Rr} satisfying condi- 
tions ^ and ^ above corresponds to a physically pos- 
sible measurement. In the following, we explore what 
kinds of measurement this actually allows, and the con- 
sequences for information processing. Intuitively, the fact 
that conditions (O and ([7]) must be satisfied means that 
there is a tradeoff between states and measurements. If 
there is a larger space of states, then there is a smaller set 
of measurements that are compatible with those states. 
The space of states in box world is in an obvious sense 
maximal, so one would expect the possibilities for mea- 
surement to be less interesting than in, say, quantum 
theory. This is indeed the case. 



Box world is one particular operational model, which 
has a natural definition in terms of the framework defined 
above, and which it is interesting to compare with the 
quantum and classical theories. Box world is defined as 
follows. Any P satisfying: 



IV. MEASUREMENTS IN BOX WORLD 

The following property of measurements in box world 
is proven in Appendix D of [l|. 
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FIG. 1: An example of a basic measurement on two systems, 
each with x,a £ {0, 1}. 

Theorem 1 All effects in box world can he represented 
using only positive arrays where each component satisfies 

< i?(a|x) < 1. (12) 

From hereon, assume that effects are indeed represented 
this way. 

hi the case of multi-partite systems, we have aheady 
defined product states, and distinguished separable and 
entangled states. Similar definitions apply to effects. A 
multi-partite effect is a product effect if 

i?(a|x) = i?i(ai|a;i)i?2(a2|a;2) ■ • ■ i?„(a„|a;„), 

where Ri{ai\xi) is a valid effect on the ith system. An 
effect is separable if it can be written as a sum of product 
effects, otherwise it is entangled. (Note that the sum 
here really is just a sum - not a convex combination, as 
in the definitions applicable to states.) Any array R with 
one entry e (0, 1] and the rest zero represents a product 
effect. Hence Theorem [T] yields 

Corollary 1 There are no entangled effects in box world. 

Section IVTl shows that Theorem [1] also prevents entangle- 
ment swapping and teleportation in box world. 

A certain class of measurements is particularly simple 
and will play a special role in what follows. 

Definition 1 A measurement is basic if it can be imple- 
mented by a sequence of fiducial measurements on indi- 
vidual subsystems, where later measurement choices may 
depend (deterministically) on earlier outcomes, and the 
final measurement outcome r is given by a deterministic 
function of the fiducial measurement outcomes a. 

An example of a basic measurement for two subsystems 
is given in Figure [TJ where fiducial measurement xi = 
is performed on the first subsystem and then measure- 
ment X2 = ai is performed on the second, and the final 
measurement outcome is given by r = a2. In general, it 
is reasonable to require that measurements of this form 



are in the set of allowed measurements in an operational 
model. In fact, as we can also choose such basic mea- 
surements at random, it is also reasonable to require that 
convex combinations of basic measurements (defined in 
an obvious way) are allowed. Note that for a single sys- 
tem, each basic measurement corresponds to a fiducial 
measurement, possibly with relabelled outputs. 



Theorem 2 All valid measurements on single or bipar- 
tite systems in box world are convex combinations of basic 
measurements. 



This means that any measurement on a single or bipartite 
system can be implemented by a probabilistic protocol 
involving only fiducial measurements. Especially given 
Corollary [U it would be natural to assume that Theo- 
rem [5] generalises to multi-partite systems in box world, 
and indeed this was hypothesized in Ref. [Ij. However, 
things are not so simple. 

Theorem 3 For tri-partite .systems, there are measure- 
ments which do not reduce to a convex combination of 
basic measurements. 



Section IV Bl illustrates this with a specific example. 

Finally, there is at least some limitation on the power 
of measurements in box world, even in the multi-partite 
case. 

Theorem 4 For any single or multi-partite system, all 
allowed measurements can be simulated using fiducial 
measurements and post-selection (i.e., the measurement 
is allowed to sometimes fail). 



V. PROOFS 

Some preliminary remarks will be useful. First, recall 
Theorem [1] which states that the entries of an effect R 
can be assumed non-negative. This is used throughout 
this section. 

Now consider the arrays R^ corresponding to the out- 
comes of a basic measurement. Suppose that a basic 
measurement is carried out, with final outcome r, and 
that during its execution, the fiducial measurements x 
are performed, with outcomes a. In this case, say that 
the triple {r, a, x} is realized. A basic measurement can 
be represented such that the component i?r(a|x) is equal 
to 1 if and only if it is possible for {r, a, x} to be real- 
ized, else it is 0. From hereon we make this choice. For 
example, the bipartite measurement illustrated in Fig. [T] 
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is represented by 



A. Proof of Theorem [2] 



Ro{aia2\xiX2) 



Ri{aia2\xiX2) = 
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(13) 



(14) 



Definition 2 The total measurement array for a mea- 
surement {Rr} is given by 



r 

From Eq. (O, M satisfies 



M • p = 1 yp eV. 



(15) 



(16) 



Now consider the total measurement array correspond- 
ing to a basic measurement. It has a simple form, which 
can be described iteratively. First, since each a corre- 
sponds to a specific r, the component M(a|x) is equal 
to 1 if and only if it is possible for the pair {a, x} to be 
realized. The probability of {a, x} being realized is given 

by 



Pr(a,x) = Af(a|x) x P(a|x). 



(17) 



Now, for a single system, M{a\x) = S^i for some i. For 
a multi-partite system composed of n subsystems, there 
must exist some k, and some i, such that the first step 
in the basic measurement is to perform the fiducial mea- 
surement Xk = i on the fcth subsystem. Let Xk represent 
a sequence of measurements and a^ a sequence of out- 
comes on the remaining n — 1 subsystems. M satisfies 

Af(a|(a;fc ^z)xfe) = 0. (18) 

Define a new {n — l)-dimcnsional array Mq^, such that 

Afajafc|xfe) = M{akaLk\{xk = i)xfe). (19) 

For all Ofe, must correspond to a valid basic mea- 

surement on (n — 1) subsystems. 

Finally, with a suitable choice of deterministic function 
of a for the output r, Af (a|x) (consisting of Os and Is) 
can be decomposed into any sum of arrays Rr{si\x) (also 
consisting of Os and Is). For the measurement of Figure[TJ 
outcomes are represented by Rq and i?i as in Eqs. (jl3p 
and (|14p . and M is given by 



M{aia2\xiX2) 



( ^ ^ 


\ 





1 1 








V 


0/ 



(20) 



Consider a measurement on n systems with outcomes 
{Rf.} and total measurement array M. The measure- 
ment is a convex combination of basic measurements if it 
can be performed by rolling dice, say, and then perform- 
ing one basic measurement or another depending on the 
outcome of the dice roll. In this case it is obvious that 
M is a convex combination of total measurement arrays 
for basic measurements. 

The first step in the proof of Theorem [2] is to note 
that the converse also holds. That is, given M and 
{Rr}, if M can be written as a convex combination of 
total measurement arrays for basic measurements, then 
there is a convex combination of basic measurements with 
the same total measurement array M, and with out- 
comes corresponding to {Rr}. To see this, suppose that 
M = E.ftMi, where < < 1, = 1 and 

is the total measurement array for a basic measurement. 
Construct an appropriate convex combination of basic 
measurements as follows. With probability qi, let the or- 
der of fiducial measurements to perform be that dictated 
by Mi. The probability of measuring x and obtaining 
outputs a is given by Equation (jl7p (which continues to 
hold for convex combinations of basic measurements). In 
the case that {a, x} is realized, announce result r with 
probability Pr(r|a, x) = i?r(a|x)/Af(a|x) (where if {a, x} 
is realized, the right hand side must be > and < 1). 
The overall probability of obtaining outcome r is now 
given by 

Pr(r) = ^Pr(r|a,x)Pr(a,x) (21) 

ax 

= ^i?.(a|x)P(a|x) (22) 

ax 

= R^.P, (23) 

as required. 

In order to prove Theorem [21 it is thus sufficient to 
show that for any measurement on a single or bi-partite 
system in box world, M can be written as a convex com- 
bination of total measurement arrays for basic measure- 
ments. Let a suhnormalised basic measurement array 
have the form M = aM, for < a < 1 and M a total 
measurement array. The strategy is to show that given 
a (non-zero) suhnormalised M, it is always possible to 
subtract a (non-zero) suhnormalised basic measurement 
array to leave a suhnormalised M' = M — with 
at least one additional zero entry. By iteration, we can 
then prove that any normalised M can be built from a 
convex combination of basic measurement vectors. 

Since the total measurement array M is central to the 
analysis of this section, from here on we use the term 
'measurement' to refer both to the measurement itself 
and to M, depending on context. 

We will use variables with subscripts to denote sets 
of numbers, indexed by the value of the subscript. For 
example, refers to a set of a values (one for each value 
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of x). We will use notation such as x* to refer to a 
particular a;-value. 



1. Single-system measurements 

In Ref. [H it is shown that all single-system measure- 
ments in box world are mixtures of fiducial measure- 
ments. We include an alternative proof here to illustrate 
the techniques used in the bi-partite case in a simpler 
setting. 

Consider a (non-zero) subnormalised M associated 
with a single system characterised by any set of fiducial 
measurements. One of the following must hold: 

1. It is possible to subtract a (non-zero) subnor- 
malised basic measurement Mb from M to leave 
a valid subnormalised M' = M — M^. In this case 
there must be a measurement choice x* for which 
M{a\x*) > Va, hence it is possible to subtract 
the subnormalised basic measurement MB{a\x) = 
kSx,x' where k = mma{M{a\x*)). This generates 
at least one additional entry in M'. 

2. It is not possible to subtract a (non-zero) subnor- 
malised basic measurement from M without creat- 
ing a negative component. In this case, there must 
exist a set Ux such that M{ax\x) = Vx. In terms 
of the representation of M as an array, there must 
be at least one zero in each block. 

Case m however, is impossible. Consider the state P 
defined by P{a\x) — Sa,a^- P is clearly an allowed state, 
as it is positive, normalised and non-signalling. However, 
if case 2 holds then 

M.P = J2 M{a\x)P{a\x) = 0. (24) 

ax 

This implies that M.P = for aU states P. But the 
only way to achieve this is to take M = 0, which contra- 
dicts the initial assumption that M is non-zero. There- 
fore case [T] is the only possibility. Subnormalised basic 
measurements can be subtracted from M until the zero 
vector remains. Hence any single-system measurement 
can be expressed as a finite sum of subnormalised basic 
measurements. Since M.P = 1 for any state P and nor- 
malised M, this procedure yields a decomposition of M 
as a convex combination of basic measurements. 



2. Bi-partite system measurements 

Consider a (non-zero) subnormalised M, associated 
with a bipartite system, with components M{ab\xy). (In 
this subsection, fiducial measurements are labelled x, y, 
and outcomes a, 6, rather than xi,X2 and ai,a2 respec- 
tively.) One of the following must hold: 



1. It is possible to subtract a (non-zero) subnor- 
malised basic measurement Mg from M to leave 
a valid subnormalised M' = M — M^ . In this case 
at least one of the following must also be true: 

(a) There exists an x* , and a set j/q, such that 
M{ab\x*ya) > Va, 6. It is possible to sub- 
tract the subnormalised basic measurement 
Msiablxy) = k5x,x*^y,ya from M, where k — 
min(j,b M[ab\x*ya), generating at least one ad- 
ditional element in M'. 

(b) There exists a y* , and a set Xb, such that 
M{ab\xby*) > Va, 6. It is possible to sub- 
tract the subnormalised basic measurement 
Mg^ablxy) = kSy^y-Sx^xt from M, where k = 
min(j,b M {ab\xby*) , generating at least one ad- 
ditional element in M'. 

2. It is not possible to subtract a (non-zero) subnor- 
malised basic measurement from M without cre- 
ating a negative component. In this case there 
exist sets ax, and bxy, such that M{axbxy\xy) = 
Va;,?/, and there exist sets by, and Qxy, such that 
M {axyby\xy) = OW x . In terms of the representa- 
tion of M as an array, this means that each row of 
blocks (corresponding to fixed x) contains a row of 
components (corresponding to fixed x and a — Ux) 
with at least one zero entry in each block (where 
b — bxy). Similarly, each column of blocks contains 
a column of components with at least one zero en- 
try in each block. 

But case [5] is impossible. The example at the end of 
this subsection may be helpful in understanding the var- 
ious stages of the following. 

To derive a contradiction, suppose that case [H holds. 
Consider a value x* , and the corresponding Qx' and bx*y 
for which AI{ax'bx*y\x*y) = Vy. Consider a product 
state P such that P{ax*bx*y\x*y) — 1 Vy, and another 
product state P' obtained from P by swapping the out- 
puts a — ax* with a = a' when x — x* {& local relabelling 
of the outputs). 

From Equation (|16p and the definition of a subnor- 
malised total measurement array, it is clear that 

M.(P-P') = ^ M{ab\xy){P{ab\xy)--P\ab\xy)) = 0. 

(25) 

This implies that 

Y,{M(ax'bx*y\x*y) - M{a'bx'y\x*y)) = 0, (26) 
y 

hence M(a'bx*y\x*y) — Vy. By modifying the cho- 
sen parameters x* and a', this generalises to the result 
that M{abxy\xy) = ya,x,y. Similarly, M{axyb\xy) — 
yb,x,y. In terms of the array, each subarray, corre- 
sponding to particular values of x and y, contains both 
a row and a column of zeroes. 
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But now consider an arbitrary element M{a*b*\x*y*). 
Let P be a bipartite product state with P{a*b*\x*y*) = 1 
and P{a*bx*y\x*y) — \ \l y ^ y*, and P' a bipartite 
product state obtained from P by swapping a = a* 
with a — ax*y when x — x* . From Equation psp. 
M{a*b*\x*y*) = 0. This contradicts the initial assump- 
tion that M is nonzero. Hence case [5] is impossible, and 
either case[Ta|or case I Ibl must hold. Iteratively subtract- 
ing subnormalised basic measurements from M until the 
zero vector remains yields an expression for M as a con- 
vex combination of basic measurements. 

We now illustrate the above proof for an example bi- 
partite measurement, where each subsystem is charac- 
terised by two fiducial measurements each with three out- 
puts. Consider a situation in which case[2]holds (it is not 
possible to remove a basic measurement), and the repre- 
sentation of M as an array contains zeroes as follows 
(with all other elements unknown): 



(27) 



. . 


. 1 . \ 


1 • • 


. . 


1 - 





1 • • 




• 1 • 

1 - 


1 • • 


. . / 



Here, the outer matrix corresponds to the measure- 
ment choice (x vertically, y horizontally), and the inner 
submatrices correspond to the outcomes (a vertically, b 
horizontally). By comparing pairs of product states as 
above, deduce that each line of zeroes can be extended 
perpendicularly, yielding a row and column of zeroes in 
each submatrix: 



/ 
. . 
. . 



. . 

V . . 



. . \ 


. . 



. . 
. . 
/ 



(28) 



It then follows that all of the unknown elements must 
also be zero. For example, in order to show that 
M(10|00) = (the element below the top left element), 
consider the states P and P' as follows (non-zero ele- 
ments of P are shown boxed and non-zero elements of P' 
are shown in bold): 



1 














1 


m 



































ill 














ill 



































, 



(29) 



B . Proof of Theorem [3] 

We prove Theorem [3] with an explicit counterexample - 
that is, a joint measurement on three subsystems, which 
cannot be considered as a convex combination of basic 
measurements. The subsystems are characterised by two 
binary-outcome fiducial measurements each. The joint 
measurement has 8 possible outcomes, Rq, . . . , R7, with 

i?o(001|000) = i?i(110|000) = 1 (30) 

i?2(000|100) = i?3(100|100) = 1 (31) 

i?4(101|010) = i?5(lll|010) = 1 (32) 

i?6(010|001) = i?7(011|001) = 1, (33) 

and all other components 0. This measurement is rather 
subtle. Note that each individual effect here is in fact a 
product effect, in keeping with Corollary [1] which states 
that there are no entangled effects. Nevertheless the mea- 
surement as a whole cannot be realised as a basic mea- 
surement or convex combination thereof. 

To see that this cannot be constructed from basic mea- 
surements, consider the total measurement array M, il- 
lustrated in Figure H Note that element A/(001|000) is 
surrounded by 3 zeroes in the block with x — y ^ z = 
(i.e. M(101|000) = M(011|000) = M(000|000) = 0). 
As the total measurement array for a basic measurement 
is insensitive to the output of the last subsystem mea- 
sured, it consists of lines of Is inside blocks. A basic 
measurement cannot therefore have non-zero Af(001|000) 
without also having nonzero M(101|000), Af (011|000) or 
Af(000|000). Given that all the convex coefficients and 
basic measurements must be positive, it is therefore im- 
possible to construct M(abc\xyz) from convex combina- 
tions of basic measurements. Since each non-zero element 
of M{abc\xyz) corresponds to a different measurement 
outcome, there are no equivalent vectors which imple- 
ment the same measurement, hence this measurement is 
impossible to simulate by any convex combination of ba- 
sic measurements. 

It remains only to show that {R^} does indeed repre- 
sent a valid measurement. The condition that R^.P > 
for all valid P is ensured by the positivity of the entries 
of Rr. In addition, outcome probabilities should sum to 
1, meaning that 

^R^.P = M.P= M{abc\xyz)P{abc\xyz) = 1, 

(34) 

for all states P satisfying the positivity, normalisation 
and the no-signalling conditions. To see that this is in- 
deed the case, evaluate the sum in Equation ([M)) to ob- 
tain 



MP 



P(001|000) + P(110|000) 
+P(000|100) + P(100|100) 
-hP(101|010)-l-P(lll|010) 
-hP(010|001) + P(011|001) 



(35) 
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and 'failure' is declared with probability 



FIG. 2: This diagram shows the three-dimensional array cor- 
responding to the total measurement array of a non-basic tri- 
partite measurement. The shaded and unshaded entries have 
values 1 and respectively. Note that the shaded 2x1x1 rect- 
angles can each be slid parallel to their long edge to complete 
the 2x2x2 cube in the front upper left of the array (corre- 
sponding to performing fiducial measurements x=0 on each 
subsystem). The invariance of the total measurement ar- 
ray under such transformations is a consequence of the no- 
signalling conditions. 



The no-signalling conditions ensure that 

P(000|100) + P(100|100) = P(000|000) + P(100|000) 
P(101|010) + P(lll|010) = P(101|000) + P(lll|000) 
P(010|001) + P(011|001) = P(010|000) + P(011|000) 

Substituting in Equation (1351) . and using the normalisa- 
tion of P, 



M • P = ^ P(a6c|000) = 1. 

a,6,c 



(36) 



Hence {Rr} represents a valid tri-partite measurement 
that cannot be simulated by a convex combination of 
basic measurements. 



C. Proof of Theorem H 

Although joint measurements on three or more subsys- 
tems in box viforld cannot generally be implemented using 
fiducial measurements, Theorem 2] implies that they are 
still in some sense simple, as they can be simulated using 
local fiducial measurements and post-selection. 

To simulate a general measurement described by {Rr}, 
first perform a random fiducial measurement x on the 
complete system (composed of a random fiducial mea- 
surement xi,X2, ■ ■ ■ ,xn on each subsystem), in which 
each X occurs with constant probability q. If result a 
is obtained in the fiducial measurement, then the general 
measurement outcome r is given with probability 



Pr(r|ax) 



Pr(a|x) 



Pr(fail|ax) = l-^Pr(r|ax)) 



= 1 - 



M(a|x) 



maxax Af(a|x) 



(38) 



Note that due to the way they are constructed, and the 
fact that Rr{a\x.) > 0, these probabilities are all positive, 
and the total probability for declaring failure or some 
output is one. 

The probability of obtaining output r given that the 
simulated measurement succeeds is then given by 



Pr(r|success) 



Eaxg-P(a|x)Pr(r|ax) 
E.'Eax9^(a|x)Pr(r'|ax) 

VP.(a|x)P(a|x) 



(39) 



maxax M (a|x) 



(37) 



which is exactly what one would expect from a perfect 
implementation of the measurement. We have therefore 
proved Theorem [4l 

The same approach docs not apply to quantum theory 
because there Rr{a.\x.) can contain negative components. 
In fact, the theorem does not hold for quantum theory. 
Bell measurements, for example, cannot be simulated by 
local Pauli measurements and post-selection. 



VI. CONSEQUENCES FOR INFORMATION 
PROCESSING 



One reason for investigating the available measure- 
ments in box world is that we can draw conclusions about 
information processing in box world, and contrast this 
with information processing in quantum theory. The 
facts that there are no entangled effects in box world, 
that measurements are limited to probabilistic mixtures 
of basic measurements (for single and bipartite systems) , 
and can be simulated with postselection (for any system) 
imply that information processing in box world is in some 
ways rather limited. This is despite the fact that box 
world allows highly entangled states, which can exhibit 
strong nonlocality in violation of Tsirelson's bound. 

Consider first entanglement swapping [T2j . In quantum 
theory, the simplest example of entanglement swapping 
is as follows. Alice and Bob share two quantum systems 
in a singlet state |i/'-)aSi, and Bob and Charlie share 
two more systems, also in a singlet state \iP-)b2C- Bob 
performs a Bell basis measurement on systems Bi and B2 
and announces the outcome. Alice's and Charlie's sys- 
tems will now be in a maximally entangled state (where 
which entangled state they share depends on Bob's out- 
come). Refs. Pi and 11 ] both offer proofs that an analo- 
gous procedure is impossible in box world in the special 
case that each system is characterised by two binary- 
output fiducial measurements. Here we show that this 
result is general. 
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Theorem 5 In box world, there is no entanglement 
swapping. In particular, suppose that Alice shares with 
Bob any number of systems in a joint state P, and Bob 
shares with Charlie any number of systems in a joint state 
Q and that the initial joint state of all systems is a direct 
product of P and Q . Then there is no measurement that 
Bob can perform on his systems that will, for some out- 
come, result in an entangled state shared between Alice 
and Charlie. 

Proof. Let the systems held by the parties be denoted 
A, Bi , B2 and C, such that P is the state of A and Bi and 
Q is the state of B2 and C. The proof is general enough 
that any of these may themselves be composite systems. 
Let Pbiyi be the collapsed state of the A system, con- 
ditioned on fiducial measurement yi being performed on 
the Bi system with outcome bi. The collapsed state is 
defined such that its components satisfy 



-Pbiyi(a|x) = 



P(abi|xyi) 
Ea^(abilxyi) 



(40) 



is 



If Q is a joint state of systems B2 and C, then Qb2y2 
a collapsed state of the C system, defined similarly. 

Suppose that Bob makes a joint measurement on all 
of his subsystems, with an outcome r, and that Alice 
and Charlie perform fiducial measurements x and z with 
outcomes a and c. It is quite easy to show that the 
probability of getting outcomes r, a, c is given by 

Pr(rac|xz)= ^ i?.(bib2|yiy2)/'(abi|xyi)g(b2c|y2z). 

(41) 



biyi 

b2y2 



Equation (|1T|) can be reexpressed as 



Pr(rac|xz) 



with 



J2 C'bib2yiy2^biyi(a|x)Qb2y2(c|z) 
bib2yiy2 



(42) 



^bib2yxy2 =^-(bib2|yiy2)P(bi|yi)g(b2|y2). (43) 

The collapsed state of the AC system, given outcome 
r for Bob's measurement, satisfies 

Pr(rac|xz) Pr(rac|xz) , , 

PJac xz) = — \ ' / ^ ^ — ^ ' / , 44) 
^ ' ' Pr(r|xz) 2^^Pr(rac|xz) ^ ' 

from which it follows that 

P^(ac|xz) = •^bib2yiy2-fbiyi(a|x)Qb2y2(c|z) 
bib2yiy2 

(45) 

where 



XL 



bib2yiy2 



bib2yiy2 



bib2yiy2 bib2yi} 



(46) 



Due to the positivity of i?r(bib2|yiy2) (Theorem [T]), 
the '^bib2yiy2 positive. Note also that 



Sbib2yiy2 ^bib2yiy2 = l' ^hus Equation gS]) repre- 
sents a separable state for Alice and Charlie. Hence 
Bob's measurement cannot introduce entanglement be- 
tween Alice and Charlie, whatever the result. □ 

Corollary 2 In box world, states cannot be teleported. 

This is immediate given the impossibility of swapping en- 
tanglement. If teleportation were possible in box world, 
then it would be possible to achieve entanglement swap- 
ping by teleporting one half of an entangled state. 

Note that in [20, HH, it is shown that entanglement 
swapping is possible in an alternate theory within the 
probabilistic framework, that has a smaller state space 
than box world [1^. This is a good illustration of the 
trade off between states and measurements in probabilis- 
tic theories. 

Our final theorem concerns dense coding. In quan- 
tum theory, dense coding allows Alice to send two bits of 
classical information to Bob via the transmission of only 
one qubit, provided that they intially share an entangled 
state [131 . This contrasts the fact that if no entanglement 
is shared, a single qubit can only be used to transmit one 
bit of classical information [23|. The procedure is as fol- 
lows. Suppose that Alice and Bob share two qubits in 
a singlet state \iI)-)ab- Alice now performs one of four 
possible unitary transformations, /, Ux, <Jy, on her 
qubit, depending on the two classical bits she wishes to 
send. She sends the qubit to Bob, who, with the two 
qubits in his possession, performs a Bell basis measure- 
ment. The outcome of this measurement tells him with 
certainty which transformation Alice performed. 

Theorem 6 In box world, there is no dense coding. 

Proof. Suppose that Alice and Bob initially share a 
bipartite system in a joint state P. In a dense coding 
protocol, Alice would perform a transformation T on her 
system, where T depends on the message she wishes to 
send. Recalling Equation [S] for single systems, the effect 
of Alice's transformation on the global state is P P' 
where 

P\a'b\x'y) = J2 T(a'\x\ a\x)P{ab\xy). (47) 



Alice sends her system to Bob, who performs a measure- 
ment on the bipartite system. However, by Theorem [5J 
Bob's 

measurement is a convex combination of basic mea- 
surements. If Bob is to learn the message with certainty, 
random choices cannot help, so assume his measurement 
is a basic measurement. There are two cases: either the 
basic measurement begins with a fiducial measurement 
on system A or it begins with a fiducial measurement 
on system B. In the second case. Bob could equally 
have performed the fiducial measurement on system B 
before Alice sends system A, indeed before she performs 
her transformation. Hence the protocol is equivalent to 
a protocol with no entanglement, in which Alice simply 
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begins with a single system, performs a transformation 
and sends it to Bob. The amount of classical information 
transmitted can be no more than the sending of a single 
box allows. In the first case, the proof is slightly more in- 
volved. Consider the fiducial measurement on system A 
that Bob is to perform. In an equivalent protocol, Alice 
performs this measurement herself, just after her trans- 
formation, and then sends to Bob the classical outcome of 
the measurement, instead of system A. Let this measure- 
ment have d possible outcomes. In any protocol in which 
the only transmission from Alice to Bob is a number from 
1 to d, it is impossible for Bob to distinguish > d mes- 
sages, even if there is pre-shared entanglement. [2^ But 
even without pre-shared entanglement, the transmission 
of system A would suffice for Bob to distinguish d possi- 
ble messages. Alice simply encodes the message into the 
outcome of the d-outcome fiducial measurement. Hence 
the dense coding protocol confers no advantage. □ 

VII. CONCLUSIONS 

Box world is not classical, nor quantum, but it has a 
natural and easy definition in terms of an operational 
framework. Of course it does not describe our universe. 
So why bother with it? Simply for the sake of compari- 
son with quantum theory. In particular, it is interesting 
to explore the information processing possibilities of box 
world and to compare these with the possibilities in quan- 
tum theory. There are ways in which box worlders are 
better off than the inhabitants of a quantum universe: as 
van Dam showed they find communication complex- 
ity problems trivial. But in other ways, the box worlders 



are worse off: the results of this paper imply that they 
cannot do entanglement swapping, teleportation or dense 
coding. 

These differing powers can be traced back to a trade- 
off that exists between the states which a theory allows 
and the measurements it allows. Box world is permis- 
sive with respect to states - all nonlocal correlations can 
be realized. But this forces a paucity of measurements 
which means that the theory is very restricted in other 
ways. Quantum theory is actually remarkable in the bal- 
ance it achieves between the two, yielding potent nonlo- 
cal correlations in addition to a broad range of possible 
measurements and dynamics. This leads us to speculate 
that quantum theory is, in at least some ways, optimal. 

Finally, one issue that we have not raised is that of 
computation. It is possible to define a circuit model for 
computation in box world, similar to the classical and 
quantum circuit models [l[. Would a box computer be 
even more powerful than a quantum computer? Alterna- 
tively, in view of the restricted dynamics, would it per- 
haps be no better than a classical computer? 
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